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Description 

REFORMATTING BINARY IMAGE DATA 
TO GENERATE SMALLER COMPRESSED 

IMAGE DATA SIZE 

Background of the Invention 
Field of Invention 

[0001] This invention is directed to reformatting binary image 

data. 
Related Art 

[0002] Binary image data means that each pixel in the image data 
is a discrete value "1" or "0" representing either a black 
pixel or a white pixel. Documents to be converted into bi- 
nary image data are typically continuous tone images. 
Continuous tone image means that the image is repre- 
sented by various shades of color or various shades of 
gray. Various processes can be used to convert continu- 
ous tone image into binary image data, and yet give an 
impression of continuous tone image. 



[0003] one such process, the halftoning process uses a mathe- 
matically stored screen pattern. The process systemati- 
cally compares each pixel's continuous tone value with the 
value of the screen. If the continuous tone value of the 
pixel is less dense than the screen value, a white pixel is 
produced. On the other hand, if the pixel value is more 
dense than the screen value, a black pixel is produced. 
Simply stated, the process converts a gray scale image or 
a color image into binary image data, but gives the im- 
pression of multiple gray levels by producing more white 
pixels for a less-dense area and more black pixels for a 
denser area. 

[0004] a specific type of halftoning process called error diffusion 
is an adaptive binarization process which has the property 
of preserving the local average gray level of the input 
continuous tone image. Specifically, error-diffusion prop- 
agates the error generated during binarization to neigh- 
boring pixels. 
Summary of the Disclosure 

[0005] Binary image data of converted documents occupy a very 
large memory space in an image data processing system. 
Large binary image data also consumes large bandwidth 
when transmitted, for example, over a local or wide area 



network, over an intranet, an extranet or the Internet, or 
other distributed networks. Therefore, compression 
schemes, such as CCITT G3/G4 are used to reduce the 
size of the binary image data. However, binary image data 
that are error-diffused do not compress well using CCITT 
G3/G4. 

[0006] Additionally, binary image data containing both high spa- 
tial frequency data and smoothly varying low spatial fre- 
quency data may not optimally compress well together. 
CCITT G3/G4 is a lossless compression scheme that com- 
presses well on image data having binary text and/or line 
art image data or, in general, any high spatial frequency 
image data containing sharp edges or color transitions. 
CCITT G3/G4 does not compress well on image data or, in 
general, image data having smoothly varying low-spatial 
frequencies such as gradients and/or natural pictorial 
data. Instead, a lossy compression scheme such as JPEG, 
compresses well on smoothly varying low-spatial fre- 
quency image data. 

[0007] jo accommodate for image data having different com- 
pression needs, a scheme such as the mixed raster con- 
tent (MRC) format may be used. Fig. 1 illustrates one ex- 
emplary embodiment of an MRC format. A document 100 



is segmented into a selector plane 110 (110') and a back- 
ground plane 120 (120'). Although two planes are shown 
in the figure, it could be N planes depending on the appli- 
cation. The selector plane 110 (110') contains high spatial 
frequency image data. The background plane 120 (120') 
contains smoothly varying low spatial frequency image 
data. Segmenting the document 100 into the selector 
plane 110 (110') and the background plane 120 (120') 
provides for different compression schemes to be applied 
to different planes. Thus, the most appropriate compres- 
sion scheme for each image data may be applied. 

[0008] This invention provides systems and methods for refor- 
matting binary image data into two or more planes. 

[0009] jhis invention provides systems and methods for seg- 
menting binary image data into text binary image data 
and non-text binary image data and creating two or more 
planes from the segmented binary image data. 

[0010] This invention provides systems and methods for convert- 
ing binary image data into gray scale image data and seg- 
menting the gray image data into two or more planes. 

[0011] This invention provides systems and methods for enhanc- 
ing binary image data and/or gray scale image data dur- 
ing segmentation into two or more planes. 



[0012] | n various exemplary embodiments, the systems include a 
converter to convert binary image data into gray scale im- 
age data. A segmentor segments gray scale image data 
into high spatial frequency image data in a first plane and 
low spatial frequency image data in a second plane. A first 
compressor compresses the high spatial frequency image 
data in the first plane and a second compressor com- 
presses the low spatial frequency image data in the sec- 
ond plane. 

[0013] | n various exemplary embodiments, the systems include a 
segmentor to segment binary image data into text binary 
image data in a first plane. A subtractor subtracts the text 
binary image data from binary image data to generate a 
non-text binary image data in a second plane. A converter 
converts non-text binary image data in the second plane 
into non-text gray scale image data. A first compressor 
compresses the text gray scale image data in the first 
plane and a second compressor compresses the non-text 
gray scale image in the second plane. 

[0014] These and other features and advantages of various ex- 
emplary embodiments of systems and methods according 
to this invention are described in, or are apparent from, 
the following detailed description of various exemplary 



embodiments of the systems and methods according to 

this invention. 
Brief Description of Drawings 

[0015] Various exemplary embodiments of systems and methods 
of this invention will be described in detail, with reference 
to the following figures, wherein: 

[0016] Fig. 1 illustrates one exemplary embodiment of an MRC 
format; 

[0017] pig. 2 illustrates an exemplary functional block diagram of 
a binary image data to MRC format generating system; 

[0018] Figs. 3-4 are flowcharts outlining an exemplary operation 
of the binary image data to gray scale image data con- 
verter module of Fig. 2; 

[0019] Fig. 5 is a flow chart outlining one exemplary method of 
reformatting binary image data to MRC format; 

[0020] Fig. 6 is another exemplary functional block diagram of a 
binary image data to MRC format generating system; 

[0021] Figs. 7-9 are flowcharts outlining an exemplary operation 
of a gray scale image data segmentor of Fig. 6; and 

[0022] Fig. 10 is a flow chart outlining one exemplary method of 

reformatting binary image data to MRC format. 
Detailed Description of Exemplary Embodiments 



[0023] Various exemplary embodiments of systems and methods 
according to this invention relate to scanned and/or 
printed color and/or black and white documents con- 
verted into binary image data. The systems and methods 
reformat binary image data to produce small, highly- 
compressed image data files that capture the original 
document content. According to various exemplary em- 
bodiments of systems and methods, output files are gen- 
erated in accordance with the MRC format, which includes 
both TIFF and PDF standards, as well as the PostScript 
standard. While various exemplary embodiments of sys- 
tems and methods refer to the MRC format, it should be 
appreciated that the techniques described herein may be 
implemented in other formats to achieve a desired result. 

[0024] pig. 2 illustrates a binary image data to MRC format gen- 
erating system 200 according to one embodiment of the 
invention. The binary image data to MRC format generat- 
ing system 200 may be a xerographic printer, a marking 
device, a multi-functional device, a digital photocopier, an 
ink-jet printer, a scanner, a facsimile machine, etc., suit- 
able for storing and/or receiving and/or transmitting im- 
age data. The binary image data to MRC format generat- 
ing system 200 includes a binary text segmentation mod- 



ule 210, a speckle removal module 220, a subtraction 
module 230, a binary image data to gray scale image data 
converter module 240, an optional scale module 250, an 
optional enhancement filter module 260, a lossless com- 
pression module 270, a lossy compression module 280 
and a wrapper module 290. 
[0025] The binary text segmentation module 210 receives binary 
image data from a data source. The binary text segmenta- 
tion module 210 extracts binary text image data from the 
binary image data. The speckle removal module 220 re- 
ceives the binary text image data. The binary text image 
data may contain speckle noise due to halftoning tech- 
niques such as error diffusion. The speckle removal mod- 
ule 220 removes speckle noise from the binary text image 
data to generate an enhanced binary text image data. The 
enhanced binary text image data becomes the selector 
plane 110 of the MRC format. The lossless compression 
module 270 receives the enhanced binary text image data. 
The lossless compression module 270 compresses the 
enhanced binary text image data, for example, using 
CCITT G3/G4 compression scheme. Alternatively a token- 
based compression could also be applied to the text im- 
age data, for example using JBIG2 compression. The com- 



pressed enhanced binary text image data is sent to the 
wrapper module 290. 
[0026] | n parallel, the enhanced binary text image data is sent to 
the subtraction module 230. The subtract module 230 
subtracts the enhanced binary text image data from the 
binary image data to generate non-text binary image 
data. The binary image data to gray scale image data con- 
verter module 240 receives the non-text binary image 
data. The binary image data to gray scale image data con- 
verter module 240 converts the non-text binary image 
data to non-text gray scale image data. The scale module 
250 and the enhancement filter module 260 are optional. 
The scale module 250 reduces the resolution of the non- 
text gray scale image data. For example, if the resolution 
of the non-text gray scale image data is X dpi, where X is 
an integer, the scale module 250 may reduce the resolu- 
tion to X/2 dpi. The scaled non-text gray scale image data 
is sent to the enhanced filter module 260. The enhanced 
filter module 260 applies enhancement filters or tonal 
corrections to the scaled non-text gray scale image data 
to generate an enhanced non-text gray scale image data. 
The enhanced non-text gray scale image data becomes 
the background plane 120 of the MRC format. The en- 



hanced non-text gray scale image data is sent to the lossy 
compression module 280. The lossy compression module 
280 compresses the enhanced non-text gray scale image 
data, for example, using JPEG orJPEG2000 compression 
scheme. The compressed non-text gray scale image data 
is sent to the wrapper module 290. The wrapper module 
290 creates one or more data files containing the com- 
pressed selector plane 110 and the background plane 
120. 

[0027] The binary text segmentation module 210 may be created 
using well known run length analysis and/or well known 
connected component analysis techniques. One run length 
analysis analyzes a number of contiguous black pixels in 
the same row or column of an image, thereby determining 
whether the contiguous black pixels are text or non-text. 
Such a technique is described in U.S. Pat. No. 4,590,606 
to Rohrer; and in K. Kerchmar, "Amount Line Finding 
Logic", IBM Technical Disclosure Bulletin, Volume 15, No. 
5, pages 1531 to 1532 (October 1972). Another run 
length analysis analyzes whether a particular portion of 
the image is text or non-text graphical information based 
upon the number or density of black pixels in that region 
or line of pixels. This technique is described in Japanese 



Patent No. JP 60-77278 to Isobe et al. and Japanese Patent 
No. JP 60-116076 to Iwase. Yet another run length analy- 
sis analyzes whether each segment is text or non-text 
graphical information based upon the statistical proper- 
ties of the segment, as described in Yasuda et al., "Data 
Compression for Check Processing Machines", Proceed- 
ings of the IEEE, Volume 68, No. 7, pages 874 through 
885 (July 1980). 

[0028] Connected component analysis is a well-known technique 
that is also used to separate text from non-text in an im- 
age. Connected component analysis is used to identify in- 
dividual text characters in an image. Such techniques are 
described in U.S. Pat. No. 4,403,340 to Kumpf; and U.S. 
Pat. No. 4,045,773 to Kadota. One connected component 
analysis technique separates text from non-text matter by 
determining whether a length of connected black pixels is 
statistically close to a predetermined text line length. This 
technique is described in U.S. Pat. No. 4,513,442 to 
Scherl. 

[0029] a combination of run length analysis and connected com- 
ponent analysis may be used, for example, as described in 
Nolan, "Line/Symbol Separation for Raster Image Process- 
ing", IBM Technical Disclosure Bulletin, Volume 15, No. 12 



(May 1973), pp. 3879-3883. Nolan describes a process 
for deciding whether a given run length of contiguous 
black pixels in the image should be classified as a graphi- 
cal line to be discarded by determining whether it corre- 
sponds to a similar run length of black pixels in the pre- 
ceding scan line which was previously identified as a 
graphical or non-text line. 
[0030] The speckle removal module 220 may be created using 

well known filter techniques. It should be appreciated that 
simple image processing filters, such as low-pass, median 
or morphological filters are not adequate for removing 
speckle-type noise. These filters either distort the original 
image, or do not affect the speckle noise. The adaptive 
filters defined as a function proportional to the local in- 
formation content of an image (such as local normalized 
variance: variance/mean) can successfully suppress 
speckle-type noise, without distorting the original image. 
In this regard, the LLLMMSE filter ("Local Linear Minimum 
Mean Standard Error Filter") removes speckle over homo- 
geneous areas with speckle noise, whereas edges and 
high-contrast features are preserved, as described in 
Aiazzi, B. et al. "Multi-resolution Adaptive Speckle Filter- 
ing: A Comparison of Algorithms," Proceedings of the 



1997 IEEE International Geoscience and Remote Sensing 
Symposium, IGARSS '97, Part 2 (of 4), Singapore, Singa- 
pore; International Geoscience and Remove Sensing Sym- 
posium (IGARSS) v 2 1997, IEEE, Piscataway, N.J., USA, 
97CH36042, pp. 1054-1056 and Kodidis, Eleftherios et 
al. "Nonlinear Adaptive Filters For Speckle Suppression in 
Ultrasonic Images," Signal Processing, Signal Processing v 
52 n3 August 1996, pp. 357-372. 
[0031] The subtraction module 230 uses well known subtraction 
techniques to subtract enhanced binary text image from 
the binary image data to generate a non-text binary im- 
age data. 

[0032] pigs. 3-4 are flowcharts outlining an exemplary operation 
of the binary image data to gray scale image data con- 
verter module 240. The operation assumes that lookup 
tables to convert binary image data to gray scale image 
data are already generated. It should be appreciated that a 
set of look-up tables can be generated during a training 
operation for each of various different types of halftoning 
methods. A training operation is performed at least once 
for each type of halftoning. An example of a training op- 
eration may be found in U.S. Pat. No. 6,343,159 Bl to Cu- 
ciurean-Zapan et al., which is incorporated herein by ref- 



erence in its entirety, and will not be further discussed 
here. 

[0033] During binary image data to gray scale image data con- 
version, an appropriate look-up table is selected to con- 
vert a received binary image data into gray scale image 
data. The selection of the appropriate look-up table may 
be accomplished using any one of a variety of known or 
later developed techniques. For example, the look-up ta- 
ble could be selected based simply on a user's input or 
based on the characteristics of the output device that 
printed the binary image. Alternatively, the selection of 
the look-up table may be based on an analysis of the bi- 
nary image, such as, for example, a segmentation analysis 
performed on the input image to determine the halftoning 
method. If no information could be gathered about the in- 
put binary image, a generic look-up table may be used to 
convert the gray scale image. 

[0034] Referring now to Fig. 3, the operation starts at step S100 
and continues to step S110. In step S110, a decision is 
made whether an appropriate look-up table is available 
for the type of halftoning used in the binary image to be 
converted. If an appropriate look-up table is not available, 
operation continues to step S120. In step S120, the ap- 



propriate look-up table is generated. The look-up table 
may be generated, for example, using the teachings of 
U.S. Pat. No. 6,343,159 Bl to Cuciurean-Zapan et al. Op- 
eration then continues to step S130. Otherwise, if a look- 
up table is available, operation continues directly to step 
S130. In step S130, binary image data is received. Next, in 
step S140, the binary image data is converted to gray 
scale image data. The operation then continues to S150, 
where the operation ends. 

[0035] pig. 4 is a flowchart outlining in greater detail one exem- 
plary embodiment of converting the binary image data to 
gray scale image data of step S140 of Fig. 3. Starting at 
step S140, operation continues to step S1410. In step 
S1410, the first portion of the binary image containing 
pixels is selected as the current window to be analyzed. 
Then, in step S1420, the pixel pattern in the current win- 
dow is determined. Next, in step S1430, the target pixel 
location of the current window is determined. Operation 
then continues to step S1440. 

[0036] | n s tep S1440, the original pattern number of the pixel 
pattern observed in the current window is determined. 
Then, in step S1450, the unique pattern that is associated 
with the determined original pattern number is deter- 



mined. Next, in step S1460, the gray scale value for the 
determined unique pattern is read from the look-out ta- 
ble. Next, in step S1470, the determined gray scale value 
is assigned to the target pixel location of the current win- 
dow. Next, in step S1480, the current window of the bi- 
nary image is analyzed to determine if the current window 
is the last window. If the current window is not the last 
window of the binary image, operation continues to step 
S1490. In step S1490, the next window of the binary im- 
age is selected. Operation then returns to step S1420. 
Otherwise, if the current window is the last window of the 
binary image, operation continues to step S150. Further 
details of binary image data to gray scale image data con- 
version may be found in U.S. Pat. No. 6,343,159 Bl to Cu- 
ciurean-Zapan et al., which is incorporated herein by ref- 
erence in its entirety. 
[0037] Th e optional enhancement filter module 260 may use var- 
ious enhancement techniques to enhance the scaled non- 
text gray scale image data. Enhancement techniques may 
include luminance enhancement (e.g. U.S. Pat. No 
5,450,502), sharpness enhancement (e.g., U.S. Pat. No. 
5,363,209), exposure adjustment (e.g. U.S. Pat. No. 
5,414,538), contrast enhancement (U.S. Pat. No. 



5,581,370), etc. These processes can be used together in 
a predictive mode that does not require iterative process- 
ing (e.g. U.S. Pat. No. 5,347,374). In general, these pro- 
cessing methods operate by modifying a set of tonal re- 
production curves (TRCs). U.S. Pat. No. 6,175, 427 to 
Lehmbeck et al., which is incorporated herein by reference 
in its entirety, discloses a technique to enhance image 
data based on tonal correction on which the enhancement 
filter module 260 may be based. 

[0038] The lossless compression module 270 may be created 
based on lossless compression schemes such as gzip or 
CCITT G3/G4. Alternatively token-based compression 
schemes like JBIG2 could also be applied. The lossy com- 
pression module 280 may be created based on lossy com- 
pression schemes such as JPEG, JPEG2000, etc. Lossless or 
lossy compression schemes are well known in the art and 
will not be further discussed. 

[0039] FIG. 5 is a flowchart outlining one exemplary method of 
reformatting binary image data to MRC format. The oper- 
ation starts at step S200 and continues to step S210. In 
step S210, binary image data is received. Next, in step 
S220, binary text image data is extracted from the binary 
image data. Then, in step S230, speckle noise is removed 



from the binary text image data to generate an enhanced 
binary text image data, which becomes the selector plane 
of the MRC format. Next, in step S240, the enhanced bi- 
nary text image data is compressed based on a lossless or 
token-based compression scheme. The operation contin- 
ues to step S3 10. 
[0040] | n parallel with step S240, in step S250, the enhanced bi- 
nary text image is subtracted from the binary image data 
to generate a non-text binary image data. Next, in step 
S260, the non-text binary image data is converted to 
non-text gray scale image data. The non-text gray scale 
image data becomes the background plane of the MRC 
format. In step S270, the non-text gray scale image data 
is scaled. Next, in step S280, a decision is made whether 
to enhance the non-text gray scale image data. If a deci- 
sion is made to enhance the non-text gray scale image 
data, the operation continues to step S290. In step S290, 
enhancement is performed on the scaled non-text gray 
scale image data to generate an enhanced non-text gray 
scale image data. The operation then continues to step 
S300. Otherwise, if a decision is made not to enhance the 
non-text gray scale image data, the operation continues 
to step S300. In step S300, the non-text gray scale image 



data/enhanced non-text gray scale image data is com- 
pressed using a lossy compression scheme. The operation 
then continues to step S310, where one or more files are 
created that contain the compressed selector plane and 
the background plane. 

[0041] pig. 6 illustrates another binary image data to MRC format 
generating system 300 in accordance with an embodiment 
of the invention. The binary image data to MRC format 
generating system 300 includes a binary image data to 
gray scale image data converter module 310, a gray scale 
image data segmentor 320, an optional scale module 330, 
an optional enhancement filter module 340, a lossless 
compression module 350, a lossy compression module 
360 and a wrapper module 370. 

[0042] The binary image data to gray scale image data converter 
module 310 receives binary image data from a data 
source. The binary image data to gray scale image data 
converter module 310 converts the binary image data to 
gray scale image data. The gray scale image data segmen- 
tor 320 segments the gray scale image into high spatial 
frequency image data and low spatial frequency image 
data. The high spatial frequency image data becomes the 
selector plane 110'. The low spatial frequency image data 



becomes the background plane 120'. The high spatial fre- 
quency image data is sent to the lossless compression 
module 350. The lossless or token-based compression 
module 350 compresses the high spatial frequency image 
data, for example, using a CCITT G3/G4 or JBIG2 com- 
pression scheme. The compressed high spatial frequency 
image data is sent to the wrapper module 370. 
[0043] The low spatial frequency image data is sent to the scale 
module 330. The scale module 330 and the enhancement 
filter module 340 are optional. The scale module 330 re- 
duces the resolution of the low spatial frequency image 
data. The scaled low spatial frequency image data is sent 
to the enhancement filter module 340. The enhancement 
filter module 340 performs tonal correction to the scaled 
low spatial frequency image data. The enhanced low spa- 
tial frequency image data is sent to the lossy compression 
module 360. The lossy compression module 360 com- 
presses the enhanced low spatial frequency image data to 
generate a compressed enhanced low spatial frequency 
image data, which is sent to the wrapper module 370. The 
wrapper module 370 creates one or more files which con- 
tain the compressed high spatial image data and the com- 
pressed enhanced low spatial frequency image data. 



[0044] The binary image data to gray scale image data converter 
module 310, the scale module 330, the enhancement fil- 
ter module 340, the lossless compression module 350 
and the lossy compression module 360 may be similar to 
the binary image data to gray scale image data converter 
module 240, the scale module 250, the enhancement fil- 
ter module 260, the lossless compression module 270 
and the lossy compression module 280 of Fig. 2, respec- 
tively, and will not be discussed further. 

[0045] pigs. 7-9 are flowcharts outlining an exemplary operation 
of the gray scale image data segmentor 320. A gray scale 
image data may be divided into a plurality of blocks for 
processing efficiency. Referring now to Fig. 7, the opera- 
tion starts at step S400 and continues to step S410. In 
step S410, a block is received. Then, in step S420, the 
block is initially classified as either UNIFORM, SMOOTH, 
WEAK.EDGE or EDGE, and its context as either TEXT or 
NON-TEXT. The block is then be reclassified as either 
SMOOTH or EDGE, depending upon the initial classifica- 
tion and the context. Next, in step S430, pixels in the 
block are segmented and placed on either the selector 
plane 110' or the background plane 120' based on the 
classification of the block. The operation continues to step 



S440. In step S440, a decision is made whether there are 
any more blocks to be processed. If there are more blocks 
to be processed, then the operation continues to step 
S410. Otherwise, if all the blocks have been processed, 
the operation continues to step S450 where the operation 
ends. 

[0046] pig. 8 is a more detailed flowchart of step S420 of Fig. 7. 
As described above, a block is classified into one of four 
categories. Referring to Fig. 8, the operation starts at step 
S4200 where variation of pixel values within the block is 
determined. Block variation may be determined by using 
statistical measures. Further details of statistical measures 
may be found in U.S. Pat. 6,400,844 Bl to Fan et al. which 
is incorporated herein by reference. Blocks with large vari- 
ations throughout are most likely to lie along edges of the 
image, while those containing little variations probably lie 
in uniform or at least smooth areas. Next, in step S4210, 
the block is classified based on the amount of variation. 
Measuring the variations within the block allows an initial 
classification to be assigned to the block. Then, in step 
S4220, image data within each block is reviewed in detail 
to allow context information (i.e. whether the region is in 
the text or non-text region of the image) to be updated 



and any necessary block re-classifications to be per- 
formed. The UNIFORM blocks are reclassified as SMOOTH, 
and the WEAK EDGE blocks are upgraded to EDGE in a 
TEXT context or reclassified as SMOOTH in a NON-TEXT 
context. 

[0047] pig. 9 is a more detailed flowchart of step S430 of Fig. 7. 
Fig. 9 describes the manner in which a block is segmented 
into two planes. The operation starts at step S4300 where 
measurement begins by first determining whether the 
block being processed has initially been classified as an 
EDGE. If so, the values vp of each pixel in the block are 
first compared to a brightness threshold value ts, wherein 
pixels that have values equal to or above ts are viewed as 
"bright" pixels, while those with values below ts are "dark" 
pixels. An EDGE block is segmented by placing dark pixels 
on the selector plane 110' (step 4330) and placing bright 
pixels on the background plane 120' (step 4340). 

[0048] |f i t j S determined at step 4300 that the block is not an 
EDGE block, then the operation continues to step S4320. 
In step S4320, all pixels in the block are processed to- 
gether, rather than on a pixel by pixel basis. Segmenting 
of SMOOTH (non-EDGE) pixels occurs as follows: if the 
block is in the midst of a short run of blocks that have 



been classified as SMOOTH, and further, all blocks in this 
short run are dark (v<t), then all pixels in the block is 
placed on the selector plane 110'. If the entire block is 
substantially smooth (i.e. in a long run) or is bright (in a 
short run of bright pixels), then all pixels in the block are 
placed on the background plane 120'. Further details of 
segmenting gray scale image data into text image data 
and non-text gray scale image data may be found in U.S. 
Pat. 6,400,844 Bl to Fan et al., which is incorporated 
herein by reference in its entirety. 
[0049] pig. 10 is a flowchart outlining the another exemplary 

method of reformatting binary image data to MRC format. 
The operation starts at step S500 and continues to step 
S510. In step S510, binary image data is received. Next, in 
step S520, binary image data is converted to gray scale 
image data. Then, in step S530, the gray scale image data 
is segmented into high spatial frequency gray scale image 
data and low spatial frequency gray scale image data. The 
high spatial frequency gray scale image data becomes the 
selector plane 110'. The low spatial frequency gray scale 
image data becomes the background plane 120'. Next, in 
step S540, the high spatial frequency gray scale image 
data is compressed using a lossless or token-based com- 



pression scheme. The operation continues to step S590. 
[0050] | n parallel, in step S550, the low spatial frequency gray 
scale image data is scaled. Then, in step S560, a decision 
is made whether to enhance the low spatial frequency 
gray scale image data. If a decision is made to enhance 
the low spatial frequency gray scale image data, the oper- 
ation continues to step S570. In step S570, enhancement 
is performed on the scaled low spatial frequency gray 
scale image data to generate an enhanced low spatial fre- 
quency gray scale image data. The operation then contin- 
ues to step S580. Otherwise, if a decision is made not to 
enhance the low spatial frequency gray scale image data, 
the operation continues to step S580. In step S580, the 
low spatial frequency gray scale image data/enhanced low 
spatial frequency gray scale image data is compressed us- 
ing a lossy compression scheme. The operation then con- 
tinues to step S590, where one or more files are created 
containing the compressed selector plane and the back- 
ground plane. 

[0051] it should be understood that various embodiments have 
been described based on reformatting binary images to 
2-layer MRC files containing one selector and one back- 
ground plane. However, it should be appreciated that re- 



formatting binary images includes multi-mask MRC con- 
taining multiple mask planes for the different "gray" text 
in a page and a single background plane. 
[0052] | t should be appreciated that, in various exemplary em- 
bodiments, the binary image data to MRC format generat- 
ing system of Figs. 2 and 6 can be implemented as soft- 
ware executing on a programmed general purpose com- 
puter. Likewise, the binary image data to MRC format 
generating system of Figs. 2 and 6 can also be imple- 
mented on a special purpose computer, a programmed 
microprocessor or microcontroller and peripheral inte- 
grated circuit elements, and ASIC or other integrated cir- 
cuit, a digital signal processor (DSP), a hardwired elec- 
tronic or logic circuit, such as a discrete element circuit, a 
programmable logic device, such as a PLD, PLA, FPGA or 
PAL, or the like. Each of the various signal lines as shown 
in Figs. 2 and 6 connecting the various modules and the 
like can be direct signal line connections or can be soft- 
ware connections implemented using an implication pro- 
gramming interface or the like. It should be appreciated 
that any appropriate hardware, firmware or software ele- 
ments or data structures can be used to implement one or 
more of the various modules and/or signal lines outlined 



above with respect to Figs. 2 and 6. 
[0053] it should be understood that each of the modules which 
can also be circuits, routines, applications or the like can 
be implemented as software that is stored on a computer- 
readable medium and that is executable on a programmed 
general purpose computer, a programmed special pur- 
pose computer, a programmed microprocessor, a pro- 
grammed digital signal processor or the like. Such a com- 
puter-readable medium includes using a carrier wave or 
the like to provide the software instructions to a process- 
ing device. It should also be understood that each of the 
modules which can be circuits, routines, applications, ob- 
jects, procedures, managers and the like can be imple- 
mented as portions of a suitably programmed general 
purpose computer. Alternatively, each of the circuits, rou- 
tines, applications, objects, procedures, managers and/or 
modules can be implemented as physically distinct hard- 
ware circuits within an ASIC, using a digital signal proces- 
sor DSP, using an FPGA, a PLD, a PLA and/or a PAL, or us- 
ing discrete logic elements or discrete circuit elements. 
The particular form the circuits, routines, applications, 
objects, procedures, managers and/or modules will take 
is a design choice and it will be obvious and predictable to 



those skilled in the art. It should be appreciated that the 
circuits, routines, applications, objects, procedures, man- 
agers and/or modules do not need to be of the same de- 
sign. 

[0054] | t should be appreciated that a routine, an application, a 
manager, a procedure, an object, and/or a module, or the 
like, can be implemented as a self-consistent sequence of 
computerized steps that lead to a desired result. These 
steps can be defined by and/or in one or more computer 
instructions stored in a computer-readable medium, 
which should be understood to encompass using a carrier 
wave or the like to provide the software instructions to a 
processing device. These steps can be performed by a 
computer executing the instructions that define the steps. 
Thus, the terms "routine", "application", "manager", "pro- 
cedure", "object" and/or "module" can refer to, for exam- 
ple, any appropriately-designed circuit, a sequence of in- 
structions, a sequence of instructions organized with any 
programmed procedure or programmed function, and/or 
a sequence of instructions organized within programmed 
processes executing in one or more computers. Such rou- 
tines, applications, managers, procedures, objects and/or 
modules, or the like, can also be implemented directly in 



circuitry that performs a procedure. Further, the data pro- 
cessing can be performed by a computer executing one or 
more appropriate programs, by special purpose hardware 
designed to perform the method, or any combination of 
such hardware, firmware and software elements. 
[0055] while the invention has been described in conjunction 

with various exemplary embodiments, these embodiments 
should be viewed as illustrative, not limiting. Various 
modifications, substitutes, or the like are possible within 
the spirit and scope of the invention. 



